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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2049 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

10 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

2 0 ATGCGCCCTC GGCCTATTTT GTTGCTGCTC CTCATGTTTT TGCCTATGCT GCCCGCGCCA 60 

CCGCCCGGTC AGCCGTCTGG CCGCCGTCGT GGGCGGCGCA GCGGCGGTTC CGGCGGTGGT 120 

TTCTGGGGTG ACCGGGTTGA TTCTCAGCCC TTCGCAATCC CCTATATTCA TCCAACCAAC 180 

25 

CCCTTCGCCC CCGATGTCAC CGCTGCGGCC GGGGCTGGAC CTCGTGTTCG CCAACCCGCC 240 

CGACCACTCG GCTCCGCTTG GCGTGACCAG GCCCAGCGCC CCGCCGTTGC CTCACGTCGT 300 

30 AGACCTACCA CAGCTGGGGC CGCGCCGCTA ACCGCGGTCG CTCCGGCCCA TGACACCCCG 360 

CCAGTGCCTG ATGTCGACTC CCGCGGCGCC ATCTTGCGCC GGCAGTATAA CCTATCAACA 420 

TCTCCCCTTA CCTCTTCCGT GGCCACCGGC ACTAACCTGG TTCTTTATGC CGCCCCTCTT 480 

35 

AGTCCGCTTT TACCCCTTCA GGACGGCACC AATACCCATA TAATGGCCAC GGAAGCTTCT 540 

AATTATGCCC AGTACCGGGT TGCCCGTGCC ACAATCCGTT ACCGCCCGCT GGTCCCCAAT 600 

40 GCTGTCGGCG GTTACGCCAT CTCCATCTCA TTCTGGCCAC AGACCACCAC CACCCCGACG 660 



10 



104 

TCCGTTGATA TGAATTCAAT AACCTCGACG GATGTTCGTA TTTTAGTCCA GCCCGGCAXA 720 

GCCTCTGAGC TTGTGATCCC AAGTGAGCGC CTACACTATC GTAACCAAGG CTGGCGCTCC 780 

GTCGAGACCT CTGGGGTGGC TGAGGAGGAG GCTACCTCTG GTCTTGTTAT GCTTTGCATA 840 

CATGGCTCAC TCGTAAATTC CTATACTAAT ACACCCTATA CCGGTGCCCT CGGGCTGTTG 900 

GACTTTGCCC TTGAGCTTGA GTTTCGCAAC CTTACCCCCG GTAACACCAA TACGCGGGTC 960 

TCCCGTTATT CCAGCACTGC TCGCCACCGC CTTCGTCGCG GTGCGGACGG GACTGCCGAG 1020 

CTCACCACCA CGGCTGCTAC CCGCTTTATG AAGGACCTCT ATTTTACTAG TACTAATGGT 1080 

15 GTCGGTGAGA TCGGCCGCGG GATAGCCCTC ACCCTGTTCA ACCTTGCTGA CACTCTGCTT 1140 

GGCGGCCTGC CGACAGAATT GATTTCGTCG GCTGGTGGCC AGCTGTTCTA CTCCCGTCCC 1200 

GTTGTCTCAG CCAATGGCGA GCCGACTGTT AAGTTGTATA CATCTGTAGA GAATGCTCAG 1260 

20 

CAGGATAAGG GTATTGCAAT CCCGCATGAC ATTGACCTCG GAGAATCTCG TGTGGTTATT 1320 

CAGGATTATG ATAACCAACA TGAACAAGAT CGGCCGACGC CTTCTCCAGC CCCATCGCGC 1380 

2 5 CCTTTCTCTG TCCTTCGAGC TAATGATGTG CTTTGGCTCT CTCTCACCGC TGCCGAGTAT 1440 

GACCAGTCCA CTTATGGCTC TTCGACTGGC CCAGTTTATG TTTCTGACTC TGTGACCTTG 1500 

GTTAATGTTG CGACCGGCGC GCAGGCCGTT GCCCGGTCGC TCGATTGGAC CAAGGTCACA 1560 

30 

CTTGACGGTC GCCCCCTCTC CACCATCCAG CAGTACTCGA AGACCTTCTT TGTCCTGCCG 1620 

CTCCGCGGTA AGCTCTCTTT CTGGGAGGCA GGCACAACTA AAGCCGGGTA CCCTTATAAT 1680 

35 TATAACACCA CTGCTAGCGA CCAACTGCTT GTCGAGAATG CCGCCGGGCA CCGGGTCGCT 1740 

ATTTCCACTT ACACCACTAG CCTGGGTGCT GGTCCCGTCT CCATTTCTGC GGTTGCCGTT 1800 



40 



TTAGCCCCCC ACTCTGCGCT AGCATTGCTT GAGGATACCT TGGACTACCC TGCCCGCGCC 1860 
CATACTTTTG ATGATTTCTG CCCAGAGTGC CGCCCCCTTG GCCTTCAGGG CTGCGCTTTC 1920 



105 



CAGTCTACTG TCGCTGAGCT TCAGCGCCTT AAGATGAAGG TGGGTAAAAC TCGGGAGTTG 1980 
TAGTTTATTT GCTTGTGCCC CCCTTCTTTC TGTTGCTTAT TTCTCATTTC TGCGTTCCGC 2040 
5 GCTCCCTGA 2049 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 2058 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 
20 (C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico Strain) 

ORF-2 region 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

25 ATGCGCCCTA GGCCTCTTTT GCTGTTGTTC CTCTTGTTTC TGCCTATGTT GCCCGCGCCA 60 

CCGACCGGTC AGCCGTCTGG CCGCCGTCGT GGGCGGCGCA GCGGCGGTAC CGGCGGTGGT 120 

TTCTGGGGTG ACCGGGTTGA TTCTCAGCCC TTCGCAATCC CCTATATTCA TCCAACCAAC 180 

30 

CCCTTTGCCC CAGACGTTGC CGCTGCGTCC GGGTCTGGAC CTCGCCTTCG CCAACCAGCC 240 



CGGCCACTTG GCTCCACTTG GCGAGATCAG GCCCAGCGCC CCTCCGCTGC CTCCCGTCGC 300 
35 CGACCTGCCA CAGCCGGGGC TGCGGCGCTG ACGGCTGTGG CGCCTGCCCA TGACACCTCA 360 

CCCGTCCCGG ACGTTGATTC TCGCGGTGCA ATTCTACGCC GCCAGTATAA TTTGTCTACT 420 



40 



TCACCCCTGA CATCCTCTGT GGCCTCTGGC ACTAATTTAG TCCTGTATGC AGCCCCCCTT 
AATCCGCCTC TGCCGCTGCA GGACGGTACT AATACTCACA TTATGGCCAC AGAGGCCTCC 



480 
540 



AATTATGCAC AGTACCGGGT TGCCCGCGCT 
GCAGTTGGAG GCTATGCTAT ATCCATTTCT 
5 TCTGTTGACA TGAATTCCAT TACTTCCACT 

GCATCTGAAT TGGTCATCCC AAGCGAGCGC 
GTTGAGACAT CTGGTGTTGC TGAGGAGGAA 

10 

CATGGCTCTC CAGTTAACTC CTATACCAAT 
GACTTTGCCT TAGAGCTTGA GTTTCGCAAT 
15 TCCCGTTACT CCAGCACGGC CCGTCACCGG 

CTGACCACAA CTGCAGCCAC CAGGTTCATG 
GTAGGTGAAG TCGGCCGCGG GATAGCTCTA 

20 

GGCGGGCTCC CGACAGAATT AATTTCGTCG 
GTTGTCTCAG CCAATGGCGA GCCAACCGTG 

2 5 CAGGATAAGG GTGTTGCTAT CCCCCACGAT 

CAGGATTATG ACAACCAGCA TGAGCAGGAT 
CCTTTTTCTG TTCTCCGAGC AAATGATGTA 

30 

GACCAGTCCA CTTACGGGTC GTCAACTGGC 
GTGAATGTTG CGACTGGCGC GCAGGCCGTA 

3 5 CTCGACGGGC GGCCCCTCCC GACTGTTGAG 

CTTCGTGGCA AGCTCTCCTT TTGGGAGGCC 
TATAATACTA CTGCTAGTGA CCAGATTCTG 

40 

ATTTCAACCT ATACCACCAG GCTTGGGGCC 
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ACTATCCGTT ACCGGCCCCT AGTGCCTAAT 600 

TTCTGGCCTC AAACAACCAC AACCCCTACA 660 

GATGTCAGGA TTCTTGTTCA ACCTGGCATA 720 

CTTCACTACC GCAATCAAGG TTGGCGCTCG 780 

GCCACCTCCG GTCTTGTCAT GTTATGCATA 840 

ACCCCTTATA CCGGTGCCCT TGGCTTACTG 900 

CTCACCACCT GTAACACCAA TACACGTGTG 960 

CTCCGCCGAG GGGCCGACGG GACTGCGGAG 1020 

AAAGATCTCC ACTTTACCGG CCTTAATGGG 1080 

ACATTACTTA ACCTTGCTGA CACGCTCCTC 1140 

GCTGGCGGGC AACTGTTTTA TTCCCGCCCG 1200 

AAGCTCTATA CATCAGTGGA GAATGCTCAG 1260 

ATCGATCTTG GTGATTCGCG TGTGGTCATT 1320 

CGGCCCACCC CGTCGCCTGC GCCATCTCGG 1380 

CTTTGGCTGT CCCTCACTGC AGCCGAGTAT 1440 

CCGGTTTATA TCTCGGACAG CGTGACTTTG 1500 

GCCCGATCGC TTGACTGGTC CAAAGTCACC 1560 

CAATATTCCA AGACATTCTT TGTGCTCCCC 1620 

GGCACAACAA AAGCAGGTTA TCCTTATAAT 1680 

ATTGAAAATG CTGCCGGCCA TCGGGTCGCC 1740 

GGTCCGGTCG CCATTTCTGC GGCCGCGGTT 1800 



10 



20 
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TTGGCTCCAC GCTCCGCCCT GGCTCTGCTG GAGGATACTT TTGATTATCC GGGGCGGGCG 1860 

CACACATTTG ATGACTTCTG CCCTGAATGC CGCGCTTTAG GCCTCCAGGG TTGTGCTTTC 1920 

CAGTCAACTG TCGCTGAGCT CCAGCGCCTT AAAGTTAAGG TGGGTAAAAC TCGGGAGTTG 1980 

TAGTTTATTT GGCTGTGCCC ACCTACTTAT ATCTGCTGAT TTCCTTTATT TCCTTTTTCT 2040 

CGGTCCCGCG CTCCCTGA 2058 
(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1647 base pairs 
15 { B) TYPE: nucleic acid 

(C) STRANDED NESS : double 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E virus (Burma) r62kDa, 
2 5 FIGURE 2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

3 0 GCGGTCGCTC CGGCCCATGA CACCCCGCCA GTGCCTGATG TCGACTCCCG CGGCGCCATC 60 

TTGCGCCGGC AGTATAACCT ATCAACATCT CCCCTTACCT CTTCCGTGGC CACCGGCACT 120 

AACCTGGTTC TTTATGCCGC CCCTCTTAGT CCGCTTTTAC CCCTTCAGGA CGGCACCAAT 180 

35 

ACCCATATAA TGGCCACGGA AGCTTCTAAT TATGCCCAGT ACCGGGTTGC CCGTGCCACA 240 

ATCCGTTACC GCCCGCTGGT CCCCAATGCT GTCGGCGGTT ACGCCATCTC CATCTCATTC 300 

4 0 TGGCCACAGA CCACCACCAC CCCGACGTCC GTTGATATGA ATTCAATAAC CTCGACGGAT 360 



108 

GTTCGTATTT TAGTCCAGCC CGGCATAGCC TCTGAGCTTG TGATCCCAAG TGAGCGCCTA 420 

CACTATCGTA ACCAAGGCTG GCGCTCCGTC GAGACCTCTG GGGTGGCTGA GGAGGAGGCT 480 

ACCTCTGGTC TTGTTATGCT TTGCATACAT GGCTCACTCG TAAATTCCTA TACTAATACA 540 

CCCTATACCG GTGCCCTCGG GCTGTTGGAC TTTGCCCTTG AGCTTGAGTT TCGCAACCTT 600 

ACCCCCGGTA ACACCAATAC GCGGGTCTCC CGTTATTCCA GCACTGCTCG CCACCGCCTT 660 

CGTCGCGGTG CGGACGGGAC TGCCGAGCTC ACCACCACGG CTGCTACCCG CTTTATGAAG 720 

GACCTCTATT TTACTAGTAC TAATGGTGTC GGTGAGATCG GCCGCGGGAT AGCCCTCACC 780 

15 CTGTTCAACC TTGCTGACAC TCTGCTTGGC GGCCTGCCGA CAGAATTGAT TTCGTCGGCT 840 

GGTGGCCAGC TGTTCTACTC CCGTCCCGTT GTCTCAGCCA ATGGCGAGCC GACTGTTAAG 900 



10 



20 



TTGTATACAT CTGTAGAGAA TGCTCAGCAG GATAAGGGTA TTGCAATCCC GCATGACATT 960 



GACCTCGGAG AATCTCGTGT GGTTATTCAG GATTATGATA ACCAACATGA ACAAGATCGG 1020 

CCGACGCCTT CTCCAGCCCC ATCGCGCCCT TTCTCTGTCC TTCGAGCTAA TGATGTGCTT 1080 

2 5 TGGCTCTCTC TCACCGCTGC CGAGTATGAC CAGTCCACTT ATGGCTCTTC GACTGGCCCA 1140 

GTTTATGTTT CTGACTCTGT GACCTTGGTT AATGTTGCGA CCGGCGCGCA GGCCGTTGCC 1200 

CGGTCGCTCG ATTGGACCAA GGTCACACTT GACGGTCGCC CCCTCTCCAC CATCCAGCAG 1260 

30 

TACTCGAAGA CCTTCTTTGT CCTGCCGCTC CGCGGTAAGC TCTCTTTCTG GGAGGCAGGC 1320 

ACAACTAAAG CCGGGTACCC TTATAATTAT AACACCACTG CTAGCGACCA ACTGCTTGTC 1380 

3 5 GAGAATGCCG CCGGGCACCG GGTCGCTATT TCCACTTACA CCACTAGCCT GGGTGCTGGT 1440 

CCCGTCTCCA TTTCTGCGGT TGCCGTTTTA GCCCCCCACT CTGCGCTAGC ATTGCTTGAG 1500 

GATACCTTGG ACTACCCTGC CCGCGCCCAT ACTTTTGATG ATTTCTGCCC AGAGTGCCGC 1560 

CCCCTTGGCC TTCAGGGCTG CGCTTTCCAG TCTACTGTCG CTGAGCTTCA GCGCCTTAAG 1620 



40 



10 
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ATGAAGGTGG GTAAAACTCG GGAGTTG 1647 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1647 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

15 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E virus (Mexico strain) 
r62kDa, FIGURE 2 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GCTGTGGCGC CTGCCCATGA CACCTCACCC GTCCCGGACG TTGATTCTCG CGGTGCAATT 60 

CTACGCCGCC AGTATAATTT GTCTACTTCA CCCCTGACAT CCTCTGTGGC CTCTGGCACT 120 

2 5 AATTTAGTCC TGTATGCAGC CCCCCTTAAT CCGCCTCTGC CGCTGCAGGA CGGTACTAAT 180 

ACTCACATTA TGGCCACAGA GGCCTCCAAT TATGCACAGT ACCGGGTTGC CCGCGCTACT 240 

ATCCGTTACC GGCCCCTAGT GCCTAATGCA GTTGGAGGCT ATGCTATATC CATTTCTTTC 300 

30 

TGGCCTCAAA CAACCACAAC CCCTACATCT GTTGACATGA ATTCCATTAC TTCCACTGAT 360 

GTCAGGATTC TTGTTCAACC TGGCATAGCA TCTGAATTGG TCATCCCAAG CGAGCGCCTT 420 

3 5 CACTACCGCA ATCAAGGTTG GCGCTCGGTT GAGACATCTG GTGTTGCTGA GGAGGAAGCC 480 

ACCTCCGGTC TTGTCATGTT ATGCATACAT GGCTCTCCAG TTAACTCCTA TACCAATACC 540 



40 



CCTTATACCG GTGCCCTTGG CTTACTGGAC TTTGCCTTAG AGCTTGAGTT TCGCAATCTC 
ACCACCTGTA ACACCAATAC ACGTGTGTCC CGTTACTCCA GCACGGCCCG TCACCGGCTC 



600 
660 



10 
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CGCCGAGGGG CCGACGGGAC TGCGGAGCTG ACCACAACTG CAGCCACCAG GTTCATGAAA 720 

GATCTCCACT TTACCGGCCT TAATGGGGTA GGTGAAGTCG GCCGCGGGAT AGCTCTAACA 780 

TTACTTAACC TTGCTGACAC GCTCCTCGGC GGGCTCCCGA CAGAATTAAT TTCGTCGGCT 840 

GGCGGGCAAC TGTTTTATTC CCGCCCGGTT GTCTCAGCCA ATGGCGAGCC AACCGTGAAG 900 

CTCTATACAT CAGTGGAGAA TGCTCAGCAG GATAAGGGTG TTGCTATCCC CCACGATATC 960 

GATCTTGGTG ATTCGCGTGT GGTCATTCAG GATTATGACA ACCAGCATGA GCAGGAT CGG 1020 

CCCACCCCGT CGCCTGCGCC ATCTCGGCCT TTTTCTGTTC TCCGAGCAAA TGATGTACTT 1080 

15 TGGCTGTCCC TCACTGCAGC CGAGTATGAC CAGTCCACTT ACGGGTCGTC AACTGGCCCG 1140 

GTTTATATCT CGGACAGCGT GACTTTGGTG AATGTTGCGA CTGGCGCGCA GGCCGTAGCC 1200 

CGATCGCTTG ACTGGTCCAA AGTCACCCTC GACGGGCGGC CCCTCCCGAC TGTTGAGCAA 1260 

20 

TATTCCAAGA CATTCTTTGT GCTCCCCCTT CGTGGCAAGC TCTCCTTTTG GGAGGCCGGC 1320 

ACAACAAAAG CAGGTTATCC TTATAATTAT AATACTACTG CTAGTGACCA GATTCTGATT 1380 

2 5 GAAAATGCTG CCGGCCATCG GGTCGCCATT TCAACCXATA CCACCAGGCT TGGGGCCGGT 1440 

CCGGTCGCCA TTTCTGCGGC CGCGGTTTTG GCTCCACGCT CCGCCCTGGC TCTGCTGGAG 1500 

GATACTTTTG ATTATCCGGG GCGGGCGCAC ACATTTGATG ACTTCTGCCC TGAATGCCGC 1560 

30 

GCTTTAGGCC TCCAGGGTTG TGCTTTCCAG TCAACTGTCG CTGAGCTCCA GCGCCTTAAA 1620 

GTTAAGGTGG GTAAAACTCG GGAGTTG 1647 

3 5 (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 984 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

{D ) TOPOLOGY: linear 



Ill 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) SG3 
region 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GGTGCGGACG GGACTGCCGA GCTCACCACC ACGGCTGCTA CCCGCTTTAT GAAGGACCTC 60 

TATTTTACTA GTACTAATGG TGTCGGTGAG ATCGGCCGCG GGATAGCCCT CACCCTGTTC 120 

15 

AACCTTGCTG ACACTCTGCT TGGCGGCCTG CCGACAGAAT TGATTTCGTC GGCTGGTGGC 180 

CAGCTGTTCT ACTCCCGTCC CGTTGTCTCA GCCAATGGCG AGCCGACTGT TAAGTTGTAT 240 

2 0 ACATCTGTAG AGAATGCTCA GCAGGATAAG GGTATTGCAA TCCCGCATGA CATTGACCTC 300 

GGAGAATCTC GTGTGGTTAT TCAGGATTAT GATAACCAAC ATGAACAAGA TCGGCCGACG 360 

CCTTCTCCAG CCCCATCGCG CCCTTTCTCT GTCCTTCGAG CTAATGATGT GCTTTGGCTC 420 

25 

TCTCTCACCG CTGCCGAGTA TGACCAGTCC ACTTATGGCT CTTCGACTGG CCCAGTTTAT 480 

GTTTCTGACT CTGTGACCTT GGTTAATGTT GCGACCGGCG CGCAGGCCGT TGCCCGGTCG 540 

3 0 CTCGATTGGA CCAAGGTCAC ACTTGACGGT CGCCCCCTCT CCACCATCCA GCAGTACTCG 600 

AAGACCTTCT TTGTCCTGCC GCTCCGCGGT AAGCTCTCTT TCTGGGAGGC AGGCACAACT 660 

AAAGCCGGGT ACCCTTATAA TTATAACACC ACTGCTAGCG ACCAACTGCT TGTCGAGAAT 720 

35 

GCCGCCGGGC ACCGGGTCGC TATTTCCACT TACACCACTA GCCTGGGTGC TGGTCCCGTC 780 

TCCATTTCTG CGGTTGCCGT TTTAGCCCCC CACTCTGCGC TAGCATTGCT TGAGGATACC 840 

4 0 TTGGACTACC CTGCCCGCGC CCATACTTTT GATGATTTCT GCCCAGAGTG CCGCCCCCTT 900 
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GGCCTTCAGG GCTGCGCTTT CCAGTCTACT GTCGCTGAGC TTCAGCGCCT TAAGATGAAG 960 
GTGGGTAAAA CTCGGGAGTT GTAG 934 
5 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 984 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

15 (iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

<C) INDIVIDUAL ISOLATE: Hepatits E Virus (Mexico strain) SG3 
region 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGGGCCGACG GGACTGCGGA GCTGACCACA ACTGCAGCCA CCAGGTTCAT GAAAGATCTC 60 

25 

CACTTTACCG GCCTTAATGG GGTAGGTGAA GTCGGCCGCG GGATAGCTCT AACATTACTT 120 
AACCTTGCTG ACACGCTCCT CGGCGGGCTC CCGACAGAAT TAATTTCGTC GGCTGGCGGG 180 
3 0 CAACTGTTTT ATTCCCGCCC GGTTGTCTCA GCCAATGGCG AGCCAACCGT GAAGCTCTAT 240 

ACATCAGTGG AGAATGCTCA GCAGGATAAG GGTGTTGCTA TCCCCCACGA TATCGATCTT 300 
GGTGATTCGC GTGTGGTCAT TCAGGATTAT GACAACCAGC ATGAGCAGGA TCGGCCCACC 360 
CCGTCGCCTG CGCCATCTCG GCCTTTTTCT GTTCTCCGAG CAAATGATGT ACTTTGGCTG 420 
TCCCTCACTG CAGCCGAGTA TGACCAGTCC ACTTACGGGT CGTCAACTGG CCCGGTTTAT 480 



35 



40 



ATCTCGGACA GCGTGACTTT GGTGAATGTT GCGACTGGCG CGCAGGCCGT AGCCCGATCG 



540 



10 
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CTTGACTGGT CCAAAGTCAC CCTCGACGGG CGGCCCCTCC CGACTGTTGA GCAATATTCC 600 

AAGACATTCT TTGTGCTCCC CCTTCGTGGC AAGCTCTCCT TTTGGGAGGC CGGCACAACA 660 

AAAGCAGGTT ATCCTTATAA TTATAATACT ACTGCTAGTG ACCAGATTCT GATTGAAAAT 720 

GCTGCCGGCC ATCGGGTCGC CATTTCAACC TATACCACCA GGCTTGGGGC CGGTCCGGTC 780 

GCCATTTCTG CGGCCGCGGT TTTGGCTCCA CGCTCCGCCC TGGCTCTGCT GGAGGATACT 840 

TTTGATTATC CGGGGCGGGC GCACACATTT GATGACTTCT GCCCTGAATG CCGCGCTTTA 900 

GGCCTCCAGG GTTGTGCTTT CCAGTCAACT GTCGCTGAGC TCCAGCGCCT TAAAGTTAAG 960 

15 GTGGGTAAAA CTCGGGAGTT GTAG 984 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS; 
20 (A) LENGTH: 147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

2 5 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

3 0 (C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) 

406*3-2 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ACCTTGGACT ACCCTGCCCG CGCCCATACT TTTGATGATT TCTGCCCAGA GTGCCGCCCC 60 

CTTGGCCTTC AGGGCTGCGC TTTCCAGTCT ACTGTCGCTG AGCTTCAGCG CCTTAAGATG 120 

4 0 AAGGTGGGTA AAACTCGGGA GTTGTAG 147 



10 



25 



35 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 base pairs 
{ B ) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico strain) 
15 406.3-2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
2 0 ACTTTTGATT ATCCGGGGCG GGCGCACACA TTTGATGACT TCTGCCCTGA ATGCCGCGCT 60 

TTAGGCCTCC AGGGTTGTGC TTTCCAGTCA ACTGTCGCTG AGCTCCAGCG CCTTAAAGTT 120 
AAGGTGGGTA AAACTCGGGA GTTGTAG 147 
(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 
3 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma Strain) 
4 0 ORF-3 region 



10 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATGAATAACA TGTCTTTTGC TGCGCCCATG GGTTCGCGAC CATGCGCCCT CGGCCTATTT 60 

TGTTGCTGCT CCTCATGTTT TTGCCTATGC TGCCCGCGCC ACCGCCCGGT CAGCCGTCTG 120 

GCCGCCGTCG TGGGCGGCGC AGCGGCGGTT CCGGCGGTGG TTTCTGGGGT GACCGGGTTG 180 

ATTCTCAGCC CTTCGCAATC CCCTATATTC ATCCAACCAA CCCCTTCGCC CCCGATGTCA 240 

CCGCTGCGGC CGGGGCTGGA CCTCGTGTTC GCCAACCCGC CCGACCACTC GGCTCCGCTT 300 

GGCGTGACCA GGCCCAGCGC CCCGCCGTTG CCTCACGTCG TAGACCTACC ACAGCTGGGG 360 

15 CCGCGCCGCT AA 372 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

2 0 (A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

<vi) ORIGINAL SOURCE: 

3 0 (C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico Strain) 

ORF-3 region 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ATGAATAACA TGTGGTTTGC TGCGCCCATG GGTTCGCCAC CATGCGCCCT AGGCCTCTTT 60 
TGCTGTTGTT CCTCTTGTTT CTGCCTATGT TGCCCGCGCC ACCGACCGGT CAGCCGTCTG 120 



40 



GCCGCCGTCG TGGGCGGCGC AGCGGCGGTA CCGGCGGTGG TTTCTGGGGT GACCGGGTTG 180 



10 



20 



25 
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ATTCTCAGCC CTTCGCAATC CCCTATATTC ATCCAACCAA CCCCTTTGCC CCAGACGTTG 240 

CCGCTGCGTC CGGGTCTGGA CCTCGCCTTC GCCAACCAGC CCGGCCACTT GGCTCCACTT 300 

GGCGAGATCA GGCCCAGCGC CCCTCCGCTG CCTCCCGTCG CCGACCTGCC ACAGCCGGGG 360 

CTGCGGCGCT GA 372 
(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) 
406.4-2 region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GCCAACCCGC CCGACCACTC GGCTCCGCTT GGCGTGACCA GGCCCAGCGC CCCGCCGTTG 60 

3 0 CCTCACGTCG TAGACCTACC ACAGCTGGGG CCGCGCCGC 99 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



40 



(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus {Mexico strain) 
5 406.4-2 region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
10 GCCAACCAGC CCGGCCACTT GGCTCCACTT GGCGAGATCA GGCCCAGCGC CCCTCCGCTG 60 

CCTCCCGTCG CCGACCTGCC ACAGCCGGGG CTGCGGCGC 99 
(2) INFORMATION FOR SEQ ID NO: 13: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

25 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) 

ORF-2 



3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Arg Pro Arg Pro lie Leu Leu Leu Leu Leu Met Phe Leu Pro Met 
15 10 15 

3 5 Leu Pro Ala Pro Pro Pro Gly Gin Pro Ser Gly Arg Arg Arg Gly Arg 

20 25 30 

Arg Ser Gly Gly Ser Gly Gly Gly Phe Trp Gly Asp Arg Val Asp Ser 
35 40 45 

40 



Gin Pro Phe Ala lie Pro Tyr lie His Pro Thr Asn Pro Phe Ala Pro 
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50 



55 



60 



Asp Val Thr Ala Ala Ala Gly Ala Gly Pro Arg Val Arg Gin Pro Ala 
65 70 75 80 

Arg Pro Leu Gly Ser Ala Trp Arg Asp Gin Ala Gin Arg Pro Ala Val 
85 90 95 



10 



Ala Ser Arg Arg Arg Pro Thr Thr Ala Gly Ala Ala Pro Leu Thr Ala 
100 105 no 



Val Ala Pro Ala His Asp Thr Pro Pro Val Pro Asp Val Asp Ser Arg 
115 120 125 



15 



Gly Ala He Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu Thr 
130 135 140 



20 



Ser Ser Val Ala Thr Gly Thr Asn Leu Val Leu Tyr Ala Ala Pro Leu 

14 $ 150 155 160 

Ser Pro Leu Leu Pro Leu Gin Asp Gly Thr Asn Thr His lie Met Ala 

165 170 175 



25 



Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr He 
180 185 190 



Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala He Ser 
195 200 205 



30 



He Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp Met 
210 215 220 



35 



Asn Ser He Thr Ser Thr Asp Val Arg He Leu Val Gin Pro Gly He 

225 230 235 240 

Ala Ser Glu Leu Val He Pro Ser Glu Arg Leu His Tyr Arg Asn Gin 

245 250 255 



40 



Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala Thr 
260 265 270 
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Ser Gly Leu Val Met Leu Cys lie His Gly Ser Leu Val Asn Ser Tyr 
275 280 285 

Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala Leu 
5 290 295 300 

Glu Leu Glu Phe Arg Asn Leu Thr Pro Gly Asn Thr Asn Thr Arg Val 
305 310 315 320 

10 Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala Asp 

325 330 335 

Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys Asp 
340 345 350 

15 

Leu Tyr Phe Thr Ser Thr Asn Gly Val Gly Glu lie Gly Arg Gly lie 
355 360 365 

Ala Leu Thr Leu Phe Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu Pro 
20 370 375 380 

Thr Glu Leu lie Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg Pro 
385 390 395 400 

25 Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser Val 

405 410 415 

Glu Asn Ala Gin Gin Asp Lys Gly lie Ala lie Pro His Asp lie Asp 
420 425 430 



30 



Leu Gly Glu Ser Arg Val Val He Gin Asp Tyr Asp Asn Gin His Glu 
435 440 445 



Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser Val 

35 450 455 460 

Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Glu Tyr 

465 470 475 480 



40 



Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr Val Ser Asp 
485 490 495 
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Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala Arg 
500 505 510 

Ser Leu Asp Trp Thr Lys Val Thr Leu Asp Gly Arg Pro Leu Ser Thr 
5 515 520 525 

lie Gin Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly Lys 
530 535 540 

10 Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr Asn 

545 550 555 560 

Tyr Asn Thr Thr Ala Ser Asp Gin Leu Leu Val Glu Asn Ala Ala Gly 
565 570 575 

15 

His Arg Val Ala lie Ser Thr Tyr Thr Thr Ser Leu Gly Ala Gly Pro 
580 585 590 

Val Ser He Ser Ala Val Ala Val Leu Ala Pro His Ser Ala Leu Ala 
20 595 600 605 

Leu Leu Glu Asp Thr Leu Asp Tyr Pro Ala Arg Ala His Thr Phe Asp 
610 615 620 

25 Asp Phe Cys Pro Glu Cys Arg Pro Leu Gly Leu Gin Gly Cys Ala Phe 

625 630 635 640 

Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys Met Lys Val Gly Lys 
645 650 655 



30 



35 



40 



Thr Arg Glu Leu 
660 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico Strain) 
5 ORF-2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Arg Pro Arg Pro Leu Leu Leu Leu Phe Leu Leu Phe Leu Pro Met 
10 1 5 10 15 

Leu Pro Ala Pro Pro Thr Gly Gin Pro Ser Gly Arg Arg Arg Gly Arg 
20 25 30 

15 Arg Ser Gly Gly Thr Gly Gly Gly Phe Trp Gly Asp Arg Val Asp Ser 

35 40 45 

Gin Pro Phe Ala lie Pro Tyr lie His Pro Thr Asn Pro Phe Ala Pro 
50 55 60 

20 

Asp Val Ala Ala Ala Ser Gly Ser Gly Pro Arg Leu Arg Gin Pro Ala 
65 70 75 80 

Arg Pro Leu Gly Ser Thr Trp Arg Asp Gin Ala Gin Arg Pro Ser Ala 
25 85 90 95 

Ala Ser Arg Arg Arg Pro Ala Thr Ala Gly Ala Ala Ala Leu Thr Ala 
100 105 110 

3 0 Val Ala Pro Ala His Asp Thr Ser Pro Val Pro Asp Val Asp Ser Arg 

115 120 125 

Gly Ala lie Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu Thr 
130 135 140 



35 



Ser Ser Val Ala Ser Gly Thr Asn Leu Val Leu Tyr Ala Ala Pro Leu 
145 150 155 160 



40 



Asn Pro Pro Leu Pro Leu Gin Asp Gly Thr Asn Thr His lie Met Ala 
165 170 175 
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Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr lie 
180 185 190 

Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala lie Ser 
5 195 200 205 

lie Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp Met 
210 215 220 

10 Asn Ser lie Thr Ser Thr Asp Val Arg lie Leu Val Gin Pro Gly lie 

225 230 235 240 

Ala Ser Glu Leu Val lie Pro Ser Glu Arg Leu His Tyr Arg Asn Gin 
245 250 255 

15 

Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala Thr 
260 265 270 

Ser Gly Leu Val Met Leu Cys lie His Gly Ser Pro Val Ksn Ser Tyr 
20 275 280 285 

Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala Leu 
290 295 300 

25 Glu Leu Glu Phe Arg Asn Leu Thr Thr Cys Asn Thr Asn Thr Arg Val 

305 310 315 320 

Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala Asp 
325 330 335 

30 

Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys Asp 
340 345 350 

Leu His Phe Thr Gly Leu Asn Gly Val Gly Glu Val Gly Arg Gly He 
35 355 360 365 

Ala Leu Thr Leu Leu Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu Pro 
370 375 380 



40 



Thr Glu Leu He Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg Pro 
385 390 395 400 
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10 



15 



Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser Val 
405 410 415 

Glu Asn Ala Gin Gin Asp Lys Gly Val Ala lie Pro His Asp He Asp 
420 425 430 

Leu Gly Asp Ser Arg Val Val He Gin Asp Tyr Asp Asn Gin His Glu 
435 440 445 

Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser Val 
450 455 460 

Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Glu Tyr 
465 470 475 480 

Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr He Ser Asp 
485 490 495 

Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala Arg 
20 500 505 510 

Ser Leu Asp Trp Ser Lys Val Thr Leu Asp Gly Arg Pro Leu Pro Thr 
515 520 525 

25 Val Glu Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly Lys 

530 535 540 

Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr Asn 
545 550 555 560 

30 

Tyr Asn Thr Thr Ala Ser Asp Gin He Leu He Glu Asn Ala Ala Gly 
565 570 575 

His Arg Val Ala He Ser Thr Tyr Thr Thr Arg Leu Gly Ala Gly Pro 
35 580 585 590 

Val Ala He Ser Ala Ala Ala Val Leu Ala Pro Arg Ser Ala Leu Ala 
595 600 605 
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Leu Leu Glu Asp Thr Phe Asp Tyr Pro Gly Arg Ala His Thr Phe Asp 
610 615 620 
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10 



15 



20 



U 25 



30 



35 



Asp Phe Cys Pro Glu Cys Arg Ala Leu Gly Leu Gin Gly Cys Ala Phe 
625 630 635 640 

Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys Val Lys Val Gly Lys 
645 650 655 

Thr Arg Glu Leu 
660 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E virus (Burma strain) 
r62kDa, FIGURE 4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ala Val Ala Pro Ala His Asp Thr Pro Pro Val Pro Asp Val Asp Ser 
15 10 15 

Arg Gly Ala lie Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu 
20 25 30 

Thr Ser Ser Val Ala Thr Gly Thr Asn Leu Val Leu Tyr .Ala Ala Pro 
35 40 45 

Leu Ser Pro Leu Leu Pro Leu Gin Asp Gly Thr Asn Thr His lie Met 
50 55 60 

Ala Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr 
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65 



70 



75 



80 



He Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala lie 
85 90 95 

Ser He Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp 
100 105 no 

Met Asn Ser He Thr Ser Thr Asp Val Arg He Leu Val Gin Pro Gly 
115 120 125 

He Ala Ser Glu Leu Val He Pro Ser Glu Arg Leu His Tyr Arg Asn 
130 135 140 

Gin Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala 
145 150 155 160 

Thr Ser Gly Leu Val Met Leu Cys He His Gly Ser Leu Val Asn Ser 
165 170 175 

Tyr Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala 
180 185 190 

Leu Glu Leu Glu Phe Arg Asn Leu Thr Pro Gly Asn Thr Asn Thr Arg 
195 200 205 

Val Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala 
210 215 220 

Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys 
225 230 235 240 

Asp Leu Tyr Phe Thr Ser Thr Asn Gly Val Gly Glu He Gly Arg Gly 
245 250 255 

He Ala Leu Thr Leu Phe Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu 
260 265 270 



Pro Thr Glu Leu He Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg 
275 280 285 
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Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser 
290 295 300 

Val Glu Asn Ala Gin Gin Asp Lys Gly lie Ala lie Pro His Asp lie 
5 305 310 315 320 

Asp Leu Gly Glu Ser Arg Val Val lie Gin Asp Tyr Asp Asn Gin His 
325 330 335 

10 Glu Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser 

340 345 350 

Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Glu 
355 360 365 

15 

Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr Val Ser 
370 375 380 

Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala 
20 385 390 395 400 

Arg Ser Leu Asp Trp Thr Lys Val Thr Leu Asp Gly Arg Pro Leu Ser 
405 410 415 

25 Thr lie Gin Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly 

420 425 430 

Lys Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr 
435 440 445 

30 

Asn Tyr Asn Thr Thr Ala Ser Asp Gin Leu Leu Val Glu Asn Ala Ala 
450 455 460 

Gly His Arg Val Ala lie Ser Thr Tyr Thr Thr Ser Leu Gly Ala Gly 
35 465 470 475 480 

Pro Val Ser lie Ser Ala Val Ala Val Leu Ala Pro His Ser Ala Leu 
485 490 495 



40 



Ala Leu Leu Glu Asp Thr Leu Asp Tyr Pro Ala Arg Ala His Thr Phe 
500 505 510 
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Asp Asp Phe Cys Pro Glu Cys Arg Pro Leu Gly Leu Gin Gly Cyg Ala 
515 520 525 

Phe Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys Met Lys Val Gly 
530 535 540 

Lys Thr Arg Glu Leu 
545 

INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E virus (Mexico strain) 
r62kDa, FIGURE 4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ala Val Ala Pro Ala His Asp Thr Ser Pro Val Pro Asp Val Asp Ser 
15 10 15 

Arg Gly Ala lie Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu 
20 25 30 

Thr Ser Ser Val Ala Ser Gly Thr Asn Leu Val Leu Tyr Ala Ala Pro 
35 40 45 

Leu Asn Pro Pro Leu Pro Leu Gin Asp Gly Thr Asn Thr His lie Met 
50 55 60 

Ala Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr 
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65 



70 



75 



80 



He Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala He 

85 90 95 

Ser He Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp 

100 105 HQ 



10 



Met Asn Ser He Thr Ser Thr Asp Val Arg He Leu Val Gin Pro Gly 
115 120 125 



He Ala Ser Glu Leu Val He Pro Ser Glu Arg Leu His Tyr Arg Asn 
130 135 140 



15 



Gin Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala 
145 150 155 160 



20 



Thr Ser Gly Leu Val Met Leu Cys He His Gly Ser Pro Val Asn Ser 

165 170 175 

Tyr Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala 

180 185 190 



25 



Leu Glu Leu Glu Phe Arg Asn Leu Thr Thr Cys Asn Thr Asn Thr Arg 
195 200 205 



Val Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala 
210 215 220 



30 



Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys 
225 230 235 240 



35 



Asp Leu His Phe Thr Gly Leu Asn Gly Val Gly Glu Val Gly Arg Gly 
245 250 255 

He Ala Leu Thr Leu Leu Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu 
260 265 270 



40 



Pro Thr Glu Leu He Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg 
275 280 285 
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Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser 
290 295 300 

Val Glu Asn Ala Gin Gin Asp Lys Gly Val Ala He Pro His Asp lie 
305 310 315 320 

Asp Leu Gly Asp Ser Arg Val Val He Gin Asp Tyr Asp Asn Gin His 
325 330 335 



10 



Glu Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser 
340 345 350 



Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Glu 
355 360 365 

15 

Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr He Ser 
370 375 380 



Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala 
38 5 390 395 400 

Arg Ser Leu Asp Trp Ser Lys Val Thr Leu Asp Gly Arg Pro Leu Pro 
405 410 415 

Thr Val Glu Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly 
420 425 430 



30 



Lys Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr 
435 440 445 

Asn Tyr Asn Thr Thr Ala Ser Asp Gin He Leu He Glu Asn Ala Ala 
450 455 460 



Gly His Arg Val Ala He Ser Thr Tyr Thr Thr Arg Leu Gly Ala Gly 
465 470 475 480 

Pro Val Ala He Ser Ala Ala Ala Val Leu Ala Pro Arg Ser Ala Leu 
485 490 495 

Ala Leu Leu Glu Asp Thr Phe Asp Tyr Pro Gly Arg Ala His Thr Phe 

500 505 510 
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Asp Asp Phe Cys Pro Glu Cys Arg Ala Leu Gly Leu Gin Gly Cys Ala 
515 520 525 

Phe Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys Val Lys Val Gly 
5 530 535 540 

Lys Thr Arg Glu Leu 
545 

10 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 



20 



25 



30 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) SG3 
region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gly Ala Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe 
15 10 15 

Met Lys Asp Leu Tyr Phe Thr Ser Thr Asn Gly Val Gly Glu lie Gly 
20 25 30 

Arg Gly lie Ala Leu Thr Leu Phe Asn Leu Ala Asp Thr Leu Leu Gly 
35 35 40 45 

Gly Leu Pro Thr Glu Leu lie Ser Ser Ala Gly Gly Gin Leu Phe Tyr 
50 55 60 



40 



Ser Arg Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr 
65 70 75 80 
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Thr Ser Val Glu Asn Ala Gin Gin Asp Lys Gly lie Ala lie Pro His 
85 90 95 

Asp lie Asp Leu Gly Glu Ser Arg Val Val lie Gin Asp Tyr Asp Asn 
5 100 105 110 

Gin His Glu Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro 
115 120 125 

10 Phe Ser Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala 

130 135 140 

Ala Glu Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr 
145 150 155 160 

15 

Val Ser Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala 
165 170 175 

Val Ala Arg Ser Leu Asp Trp Thr Lys Val Thr Leu Asp Gly Arg Pro 
20 180 185 190 

Leu Ser Thr lie Gin Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu 
195 200 205 

25 Arg Gly Lys Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr 

210 215 220 

Pro Tyr Asn Tyr Asn Thr Thr Ala Ser Asp Gin Leu Leu Val Glu Asn 
225 230 235 240 



30 



Ala Ala Gly His Arg Val Ala lie Ser Thr Tyr Thr Thr Ser Leu Gly 
245 250 255 



Ala Gly Pro Val Ser He Ser Ala Val Ala Val Leu Ala Pro His Ser 
35 260 265 270 

Ala Leu Ala Leu Leu Glu Asp Thr Leu Asp Tyr Pro Ala Arg Ala His 
275 280 285 



40 



Thr Phe Asp Asp Phe Cys Pro Glu Cys Arg Pro Leu Gly Leu Gin Gly 
290 295 300 
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Cys Ala Phe Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys Met Lys 
305 310 315 320 

Val Gly Lys Thr Arg Glu Leu 
325 

) INFORMATION FOR SEQ ID NO: 18: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico strain) 

SG3 region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Gly Ala Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe 
15 10 is 

Met Lys Asp Leu His Phe Thr Gly Leu Asn Gly Val Gly Glu Val Gly 
20 25 30 

Arg Gly He Ala Leu Thr Leu Leu Asn Leu Ala Asp Thr Leu Leu Gly 
35 40 45 

Gly Leu Pro Thr Glu Leu He Ser Ser Ala Gly Gly Gin Leu Phe Tyr 
50 55 60 

Ser Arg Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr 
65 70 75 80 

Thr Ser Val Glu Asn Ala Gin Gin Asp Lys Gly Val Ala He Pro His 
85 90 95 
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10 



15 



Asp lie Asp Leu Gly Asp Ser Arg Val Val lie Gin Asp Tyr Asp Asn 
100 105 110 

Gin His Glu Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro 
115 120 125 

Phe Ser Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala 
130 135 140 

Ala Glu Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr 
145 150 155 160 

lie Ser Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala 
165 170 175 

Val Ala Arg Ser Leu Asp Trp Ser Lys Val Thr Leu Asp Gly Arg Pro 
180 185 190 

Leu Pro Thr Val Glu Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu 
20 195 200 205 

Arg Gly Lys Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr 
210 215 220 

25 p ro Tyr Asn Tyr Asn Thr Thr Ala Ser Asp Gin He Leu He Glu Asn 

225 230 235 240 

Ala Ala Gly His Arg Val Ala He Ser Thr Tyr Thr Thr Arg Leu Gly 
245 250 255 

30 

Ala Gly Pro Val Ala He Ser Ala Ala Ala Val Leu Ala Pro Arg Ser 
260 265 270 

Ala Leu Ala Leu Leu Glu Asp Thr Phe Asp Tyr Pro Gly Arg Ala His 
35 275 280 285 

Thr Phe Asp Asp Phe Cys Pro Glu Cys Arg Ala Leu Gly Leu Gin Gly 
290 295 300 



40 



Cys Ala Phe Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys Val Lys 
305 310 315 320 
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Val Gly Lys Thr Arg Glu Leu 
325 

(2) INFORMATION FOR SEQ ID NO: 19: 

5 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

15 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) 

406.4-2 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Ala Asn Pro Pro Asp His Ser Ala Pro Leu Gly Val Thr Arg Pro Ser 
15 10 15 

25 Ala Pro Pro Leu Pro His Val Val Asp Leu Pro Gin Leu Gly Pro Arg 

20 25 30 

Arg 



30 



(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



40 (iii) HYPOTHETICAL: NO 
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10 



15 



25 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus {Mexico strain) 
406.4-2 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Ala Asn Gin Pro Gly His Leu Ala Pro Leu Gly Glu lie Arg Pro Ser 
15 10 15 

Ala Pro Pro Leu Pro Pro Val Ala Asp Leu Pro Gin Pro Gly Leu Arg 
20 25 30 



Arg 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma Strain) 
3 0 ORF-3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

35 Met Asn Asn Met Ser Phe Ala Ala Pro Met Gly Ser Arg Pro Cys Ala 

15 10 15 

Leu Gly Leu Phe Cys Cys Cys Ser Ser Cys Phe Cys Leu Cys Cys Pro 
20 25 30 

40 



Arg His Arg Pro Val Ser Arg Leu Ala Ala Val Val Gly Gly Ala Ala 
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35 40 45 

Ala Val Pro Ala Val Val Ser Gly Val Thr Gly Leu lie Leu Ser Pro 
50 55 60 

5 

Ser Gin Ser Pro lie Phe lie Gin Pro Thr Pro Ser Pro Pro Met Ser 
65 70 75 80 

Pro Leu Arg Pro Gly Leu Asp Leu Val Phe Ala Asn Pro Pro Asp His 
10 85 90 95 

Ser Ala Pro Leu Gly Val Thr Arg Pro Ser Ala Pro Pro Leu Pro His 
100 105 110 

15 Val Val Asp Leu Pro Gin Leu Gly Pro Arg Arg Glx 

115 120 

(2) INFORMATION FOR SEQ ID NO: 22: 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

2 5 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

3 0 (C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico Strain) 

ORF-3 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Asn Asn Met Trp Phe Ala Ala Pro Met Gly Ser Pro Pro Cys Ala 
15 10 15 



Leu Gly Leu Phe Cys Cys Cys Ser Ser Cys Phe Cys Leu Cys Cys Pro 
40 20 25 30 
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10 



15 



20 



Arg His Arg Pro Val Ser Arg Leu Ala Ala Val Val Gly Gly Ala Ala 
35 40 45 

Ala Val Pro Ala Val Val Ser Gly Val Thr Gly Leu lie Leu Ser Pro 
50 55 60 

Ser Gin Ser Pro lie Phe lie Gin Pro Thr Pro Leu Pro Gin Thr Leu 
65 70 75 80 

Pro Leu Arg Pro Gly Leu Asp Leu Ala Phe Ala Asn Gin Pro Gly His 
85 90 95 

Leu Ala Pro Leu Gly Glu lie Arg Pro Ser Ala Pro Pro Leu Pro Pro 
100 105 110 

Val Ala Asp Leu Pro Gin Pro Gly Leu Arg Arg Glx 
115 120 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

30 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Burma strain) 

406.3-2 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Thr Leu Asp Tyr Pro Ala Arg Ala His Thr Phe Asp Asp Phe Cys Pro 
15 10 15 

4 0 Glu Cys Arg Pro Leu Gly Leu Gin Gly Cys Ala Phe Gin Ser Thr Val 

20 25 30 



25 
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Ala Glu Leu Gin Arg Leu Lys Met Lys Val Gly Lys Thr Arg Glu Leu 
35 40 45 



5 (2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 



15 



20 



25 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E Virus (Mexico stirain) 
406.3-2 region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Thr Phe Asp Tyr Pro Gly Arg Ala His Thr Phe Asp Asp Phe Cys Pro 
15 10 15 

Glu Cys Arg Ala Leu Gly Leu Gin Gly Cys Ala Phe Gin Ser Thr Val 
20 25 30 

Ala Glu Leu Gin Arg Leu Lys Val Lys Val Gly Lys Thr Arg Glu Leu 
30 35 40 45 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 540 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



40 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E virus (Burma strain) 
5 r62kDa, 58.1 kDa 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

10 Ala Val Ala Pro Ala His Asp Thr Pro Pro Val Pro Asp Val Asp Ser 

15 10 15 

Arg Gly Ala lie Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu 
20 25 30 



15 



Thr Ser Ser Val Ala Thr Gly Thr Asn Leu Val Leu Tyr Ala Ala Pro 
35 40 45 

Leu Ser Pro Leu Leu Pro Leu Gin Asp Gly Thr Asn Thr His lie Met 
20 50 55 60 

Ala Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr 
65 70 75 80 

25 lie Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala lie 

85 90 95 

Ser lie Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp 
100 105 110 

30 

Met Asn Ser lie Thr Ser Thr Asp Val Arg lie Leu Val Gin Pro Gly 
115 120 125 

lie Ala Ser Glu Leu Val lie Pro Ser Glu Arg Leu His Tyr Arg Asn 
35 130 135 140 

Gin Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala 
145 150 155 160 



40 



Thr Ser Gly Leu Val Met Leu Cys lie His Gly Ser Leu Val Asn Ser 
165 170 175 
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Tyr Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala 
180 185 190 

Leu Glu Leu Glu Phe Arg Asn Leu Thr Pro Gly Asn Thr Asn Thr Arg 
195 200 205 

Val Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala 
210 215 220 



10 



Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys 
225 230 235 240 



15 



Asp Leu Tyr Phe Thr Ser Thr Asn Gly Val Gly Glu lie Gly Arg Gly 
245 250 255 

lie Ala Leu Thr Leu Phe Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu 
260 265 270 



20 



Pro Thr Glu Leu lie Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg 
275 280 285 



Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser 
290 295 300 



25 



Val Glu Asn Ala Gin Gin Asp Lys Gly lie Ala lie Pro His Asp lie 
305 310 315 320 



30 



Asp Leu Gly Glu Ser Arg Val Val lie Gin Asp Tyr Asp Asn Gin His 

325 330 335 

Glu Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser 

340 345 350 



35 



Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Glu 
355 360 365 



Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr Val Ser 
370 375 380 



40 



Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala 
385 390 395 400 
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Arg Ser Leu Asp Trp Thr Lys Val Thr Leu Asp Gly Arg Pro Leu Ser 
405 410 415 

Thr lie Gin Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly 
420 425 430 

Lys Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr 
435 440 445 

Asn Tyr Asn Thr Thr Ala Ser Asp Gin Leu Leu Val Glu Asn Ala Ala 
450 455 460 



Gly His Arg Val Ala lie Ser Thr 
465 470 

Pro Val Ser lie Ser Ala Val Ala 
485 

Ala Leu Leu Glu Asp Thr Leu Asp 
500 



Tyr Thr Thr Ser Leu Gly Ala Gly 
475 480 

Val Leu Ala Pro His Ser Ala Leu 
490 495 

Tyr Pro Ala Arg Ala His Thr Phe 
505 510 



Asp Asp Phe Cys Pro Glu Cys Arg Pro Leu Gly Leu Gin Gly Cys Ala 
515 520 525 



Phe Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys 
530' 535 540 



) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E virus (Mexico strain) 
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r62kDa, 58.1 kDa 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Ala Val Ala Pro Ala His Asp Thr Ser Pro Val Pro Asp Val Asp Ser 
15 10 15 

Arg Gly Ala lie Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu 
20 25 30 

Thr Ser Ser Val Ala Ser Gly Thr Asn Leu Val Leu Tyr Ala Ala Pro 
35 40 45 

Leu Asn Pro Pro Leu Pro Leu Gin Asp Gly Thr Asn Thr His lie Met 
15 50 55 60 

Ala Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr 
65 70 75 80 

2 0 lie Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala lie 

85 90 95 

Ser lie Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp 
100 105 110 

25 

Met Asn Ser lie Thr Ser Thr Asp Val Arg lie Leu Val Gin Pro Gly 
115 120 125 

lie Ala Ser Glu Leu Val lie Pro Ser Glu Arg Leu His Tyr Arg Asn 

3 0 130 135 140 

Gin Gly Trp Arg Ser Val Glu Thr Ser Gly Val ALa Glu Glu Glu Ala 
145 150 155 160 

3 5 Thr Ser Gly Leu Val Met Leu Cys He His Gly Ser Pro Val Asn Ser 

165 170 175 

Tyr Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala 
180 185 190 

40 



Leu Glu Leu Glu Phe Arg Asn Leu Thr Thr Cys Asn Thr Asn Thr Arg 
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195 



200 



205 



Val Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala 
210 215 220 

Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys 
225 230 235 240 



10 



Asp Leu His Phe Thr Gly Leu Asn Gly Val Gly Glu Val Gly Arg Gly 
245 250 255 



He Ala Leu Thr Leu Leu Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu 
260 265 270 



15 



Pro Thr Glu Leu He Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg 
2 75 280 285 



20 



Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser 

290 295 300 

Val Glu Asn Ala Gin Gin Asp Lys Gly Val Ala He Pro His Asp He 

305 310 315 320 



25 



Asp Leu Gly Asp Ser Arg Val Val He Gin Asp Tyr Asp Asn Gin His 
325 330 335 



Glu Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser 
340 345 350 



30 



Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Glu 
355 360 365 



35 



Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr He Ser 
370 375 380 

Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala 
3 85 390 395 400 



40 



Arg Ser Leu Asp Trp Ser Lys Val Thr Leu Asp Gly Arg Pro Leu Pro 
405 410 415 
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Thr Val Glu Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly 
420 425 430 

Lys Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr 
435 440 445 

Asn Tyr Asn Thr Thr Ala Ser Asp Gin lie Leu lie Glu Asn Ala Ala 
450 455 460 

Gly His Arg Val Ala lie Ser Thr Tyr Thr Thr Arg Leu Gly Ala Gly 
465 470 475 480 

Pro Val Ala lie Ser Ala Ala Ala Val Leu Ala Pro Arg Ser Ala Leu 
485 490 495 

Ala Leu Leu Glu Asp Thr Phe Asp Tyr Pro Gly Arg Ala His Thr Phe 
500 505 510 

Asp Asp Phe Cys Pro Glu Cys Arg Ala Leu Gly Leu Gin Gly Cys Ala 
515 520 525 

Phe Gin Ser Thr Val Ala Glu Leu Gin Arg Leu Lys 
530 535 540 

) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 525 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE : Hepatitis E virus (Burma strain) 

r62kDa, 56.5 kDa 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Ala Val Ala Pro Ala His Asp Thr Pro Pro Val Pro Asp Val Asp Ser 
15 10 15 

5 

Arg Gly Ala lie Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu 
20 25 30 

Thr Ser Ser Val Ala Thr Gly Thr Asn Leu Val Leu Tyr Ala Ala Pro 
10 35 40 45 

Leu Ser Pro Leu Leu Pro Leu Gin Asp Gly Thr Asn Thr His lie Met 
50 55 60 

15 Ma Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr 

65 70 75 80 



20 



lie Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala lie 
85 90 95 

Ser lie Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp 
100 105 110 

Met Asn Ser lie Thr Ser Thr Asp Val Arg lie Leu Val Gin Pro Gly 
25 115 120 125 

lie Ala Ser Glu Leu Val lie Pro Ser Glu Arg Leu His Tyr Arg Asn 
130 135 140 

3 0 Gin Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala 

145 150 155 160 



35 



Thr Ser Gly Leu Val Met Leu Cys He His Gly Ser Leu Val Asn Ser 
165 170 175 

Tyr Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala 
180 185 190 



40 



Leu Glu Leu Glu Phe Arg Asn Leu Thr Pro Gly Asn Thr Asn Thr Arg 
195 200 205 
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10 



15 



Val Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala 
210 215 220 

Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys 
225 230 235 240 

Asp Leu Tyr Phe Thr Ser Thr Asn Gly Val Gly Glu He Gly Arg Gly 
245 250 255 

He Ala Leu Thr Leu Phe Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu 
260 265 270 

Pro Thr Glu Leu He Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg 
275 280 285 

Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser 
290 295 300 

Val Glu Asn Ala Gin Gin Asp Lys Gly He Ala He Pro His Asp He 
20 305 310 315 320 

Asp Leu Gly Glu Ser Arg Val Val He Gin Asp Tyr Asp Asn Gin His 
325 330 335 

25 Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser 

340 345 350 

Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Glu 
355 360 365 

30 

Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr Val Ser 
370 375 380 

Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala 
35 385 390 395 400 

Arg Ser Leu Asp Trp Thr Lys Val Thr Leu Asp Gly Arg Pro Leu Ser 
405 410 415 



40 



Thr He Gin Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly 
420 425 430 
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Lys Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr 
435 440 445 

Asn Tyr Asn Thr Thr Ala Ser Asp Gin Leu Leu Val Glu Asn Ala Ala 
5 450 455 460 

Gly His Arg Val Ala lie Ser Thr Tyr Thr Thr Ser Leu Gly Ala Gly 
465 470 475 480 

10 Pro Val Ser lie Ser Ala Val Ala Val Leu Ala Pro His Ser Ala Leu 

485 490 495 

Ala Leu Leu Glu Asp Thr Leu Asp Tyr Pro Ala Arg Ala His Thr Phe 
500 505 510 



15 



20 



30 



35 



Asp Asp Phe Cys Pro Glu Cys Arg Pro Leu Gly Leu Gin 
515 520 525 

(2) INFORMATION FOR SEQ ID NO: 28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Hepatitis E virus {Mexico strain) 
r62kDa, 56.5 kDa 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Ala Val Ala Pro Ala His Asp Thr Ser Pro Val Pro Asp Val Asp Ser 
15 10 15 



40 



Arg Gly Ala lie Leu Arg Arg Gin Tyr Asn Leu Ser Thr Ser Pro Leu 
20 25 30 
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Thr Ser Ser Val Ala Ser Gly Thr Asn Leu Val Leu Tyr Ala Ala Pro 
35 40 45 

Leu Asn Pro Pro Leu Pro Leu Gin Asp Gly Thr Asn Thr His lie Met 
5 50 55 60 

Ala Thr Glu Ala Ser Asn Tyr Ala Gin Tyr Arg Val Ala Arg Ala Thr 
65 70 75 80 

10 He Arg Tyr Arg Pro Leu Val Pro Asn Ala Val Gly Gly Tyr Ala He 

85 90 95 

Ser lie Ser Phe Trp Pro Gin Thr Thr Thr Thr Pro Thr Ser Val Asp 
100 105 110 

15 

Met Asn Ser He Thr Ser Thr Asp Val Arg He Leu Val Gin Pro Gly 
115 120 125 

lie Ala Ser Glu Leu Val lie Pro Ser Glu Arg Leu His Tyr Arg Asn 
20 130 135 140 

Gin Gly Trp Arg Ser Val Glu Thr Ser Gly Val Ala Glu Glu Glu Ala 
145 150 155 160 

2 5 Thr Ser Gly Leu Val Met Leu Cys lie His Gly Ser Pro Val Asn Ser 

165 170 175 

Tyr Thr Asn Thr Pro Tyr Thr Gly Ala Leu Gly Leu Leu Asp Phe Ala 
180 185 190 

30 

Leu Glu Leu Glu Phe Arg Asn Leu Thr Thr Cys Asn Thr Asn Thr Arg 
195 200 205 

Val Ser Arg Tyr Ser Ser Thr Ala Arg His Arg Leu Arg Arg Gly Ala 
35 210 215 220 

Asp Gly Thr Ala Glu Leu Thr Thr Thr Ala Ala Thr Arg Phe Met Lys 
225 230 235 240 



40 



Asp Leu His Phe Thr Gly Leu Asn Gly Val Gly Glu Val Gly Arg Gly 
245 250 255 
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lie Ala Leu Thr Leu Leu Asn Leu Ala Asp Thr Leu Leu Gly Gly Leu 
260 265 270 

Pro Thr Glu Leu lie Ser Ser Ala Gly Gly Gin Leu Phe Tyr Ser Arg 
275 280 285 

Pro Val Val Ser Ala Asn Gly Glu Pro Thr Val Lys Leu Tyr Thr Ser 
290 295 300 

Val Glu Asn Ala Gin Gin Asp Lys Gly Val Ala lie Pro His Asp lie 
305 310 315 320 

Asp Leu Gly Asp Ser Arg Val Val He Gin Asp Tyr Asp Asn Gin His 
325 330 335 

Glu Gin Asp Arg Pro Thr Pro Ser Pro Ala Pro Ser Arg Pro Phe Ser 
340 345 350 

Val Leu Arg Ala Asn Asp Val Leu Trp Leu Ser Leu Thr Ala Ala Glu 
355 360 365 

Tyr Asp Gin Ser Thr Tyr Gly Ser Ser Thr Gly Pro Val Tyr He Ser 
370 375 380 

Asp Ser Val Thr Leu Val Asn Val Ala Thr Gly Ala Gin Ala Val Ala 
385 390 395 400 

Arg Ser Leu Asp Trp Ser Lys Val Thr Leu Asp Gly Arg Pro Leu Pro 
405 410 415 

Thr Val Glu Gin Tyr Ser Lys Thr Phe Phe Val Leu Pro Leu Arg Gly 
420 425 430 

Lys Leu Ser Phe Trp Glu Ala Gly Thr Thr Lys Ala Gly Tyr Pro Tyr 
435 440 445 

Asn Tyr Asn Thr Thr Ala Ser Asp Gin He Leu He Glu Asn Ala Ala 
450 455 460 

Gly His Arg Val Ala He Ser Thr Tyr Thr Thr Arg Leu Gly Ala Gly 
46 5 470 475 480 
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Pro Val Ala He Ser Ala Ala Ala Val Leu Ala Pro Arg Ser Ala Leu 
485 490 495 

Ala Leu Leu Glu Asp Thr Phe Asp Tyr Pro Gly Arg Ala His Thr Phe 
5 500 505 510 

Asp Asp Phe Cys Pro Glu Cys Arg Ala Leu Gly Leu Gin 
515 520 525 

10 (2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
20 (iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HEV 5' primer 



25 



30 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GGGGGGGATC CATATGGCGG TCGCTCCGGC CCATGACACC CCG 43 

(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
3 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HEV 3' pri 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
ATTAGAAGCT TCCGTGGCCA TTATATG 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: internally consistent sequence of 

two HEV 406.4-2 antigens 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "where Xaa is Q or P" 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "where Xaa is G or D" 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note- "where Xaa is L or S" 
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(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /note= "where Xaa is E or V" 

5 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /note= "where Xaa is I or T" 

10 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 22 

(D) OTHER INFORMATION: /note= "where Xaa is P or H" 

15 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
<B) LOCATION: 24 

(D) OTHER INFORMATION: /note= "where Xaa is A or V" 

20 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 29 

(D) OTHER INFORMATION: /note= "where Xaa is P or L" 

25 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 31 

(D) OTHER INFORMATION: /note= "where Xaa is L or P" 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Ala Asn Xaa Pro Xaa His Xaa Ala Pro Leu Gly Xaa Xaa Arg Pro Ser 
35 1 5 10 15 

Ala Pro Pro Leu Pro Xaa Val Xaa Asp Leu Pro Gin Xaa Gly Xaa Arg 
20 25 30 

4 0 Arg 



